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Abstract. In the companion paper [7] , we measured homology classes and computed the optimal 
homology basis. This paper addresses two related problems, namely, localization and stability. 
We localize a class with the cycle minimizing a certain objective function. We explore three 
different objective functions, namely, volume, diameter and radius. We show that it is NP-hard 
to compute the smallest cycle using the former two. We also prove that the measurement defined 
in [7] is stable with regard to small changes of the geometry of the concerned space. 

1. Introduction 

The problem of computing the topological features of a space has recently drawn much at- 
tention from researchers in various fields, such as high-dimensional data analysis [3, 14], graphics 
[12, 5], networks [9] and computational biology [1, 8]. Topological features are often preferable to 
purely geometric features, as they are more qualitative and global, and tend to be more robust. If 
the goal is to characterize a space, therefore, features which incorporate topology seem to be good 
candidates. In this paper, the topological features we use are homology groups over Z2, due to 
their ease of computation. (Thus, throughout this paper, all the additions are mod 2 additions.) 

In the companion paper [7] , we addressed the problems of measuring homology classes and 
finding a concise representation (a homology basis) of the homology group. We defined a size of 
a homology class as the radius of the smallest geodesic ball carrying the class, using terminology 
from relative homology. We also computed the optimal homology basis which is the homology 
basis whose elements' size have the minimal sum. 

This paper addresses two problems that are natural byproducts of the ideas in [7] . 
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Problem 1: Localization. In the companion paper [7], we localized each class in the optimal 
homology basis with its localized- cycle, which is the representative cycle carried by the smallest 
geodesic ball carrying the class. That was necessary for the algorithm to work, but was otherwise 
not an object of interest. However, the use of localized-cycle suggests an interesting problem: 
given a natural criterion of the size of a cycle, can we find the smallest cycle in a class? We will 
see that the companion paper implicitly provided one such criterion, but there are other criteria 
as well. 

The criterion should be deliberately chosen so that the corresponding smallest cycle is concise 
in not only mathematics but also intuition. Such a cycle is a "well-localized" representative cycle 
of its class. For example, in Figure 1, the cycles z\ and Z2 are well-localized representatives of 
their respective homology classes; whereas zs is not. This problem of localizing classes with well- 




Figure 1: Left: the concerned space is a disk with three holes. 

Right: cycles z\ and z 2 are well-localized; z 3 is not. 

localized cycles has potential applications in various field such as high-dimensional data analysis 
[3, 14], graphics [19, 12, 5], CAD [11], shape study [4], etc. A review of approaches of this problem 
will be given later in this section. 

In this paper, we will look at three such criteria, i.e. the volume, diameter and radius of a 
cycle. We will show that computing the smallest cycle using the former two are NP-hard, whereas 
it is polynomial to compute using the latter, which corresponds naturally to the homology class 
size measure used in the companion paper. 

Problem 2: Stability. We also address another open question suggested by the companion 
paper. The size measure of homology classes and the optimal homology basis defined in that 
paper depend on the (discrete) metric on the simplicial complex under consideration. A natural 
question is: is the measure and the basis stable with respect to small changes in metric? In 
Section 3 of this paper, we show that under a suitable definition of stability, the answer is yes. 
This provides theoretical justification for our work in the companion paper [7]. 

Restrictions. Furthermore, as in the companion paper, we make two additional requirements 
on the solution of the aforementioned problem. First, the solution ought to be computable 
for topological spaces of arbitrary dimension. Second the solution should not require that the 
topological space be embedded, for example in a Euclidean space; and if the space is embedded, 
the solution should not make use of the embedding. These requirements are natural from the 
theoretical point of view, but may also be justified based on the following applications: 

• In machine learning, it is often assumed that the data lives in a manifold whose dimension 
is much smaller than the dimension of the embedding space. 
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• In the study of shape, it is common to enrich the shape with other quantities, such as 
curvature, or color and other physical quantities. This leads to high dimensional manifolds 
(e.g, 5-7 dimensions) embedded in high dimensional ambient spaces [4] . 

Related Works. Using Dijkstra's shortest path algorithm, Erickson and Whittlesey [13] local- 
ized a 1-dimensional homology class with its shortest cycle. They defined the shortest homology 
basis as the set of (3 linearly independent homology classes, such that lengths of their shortest 
representative cycles have the minimal sum. They provided a greedy algorithm to localize classes 
in this basis with their shortest cycles. 1 The authors also showed how the idea carries over to 
finding the optimal generators of the first fundamental group, though the proof is considerably 
harder in this case. For completeness, we refer to some related works which compute a single or a 
set of non-trivial cycles satisfying certain topological and geometrical restrictions on 2-manifolds 
[12, 16, 6, 10]. 

Some researchers concentrate on 1-dimensional cycles closely related to handles which are 
much more meaningful in low dimensional applications such as graphics and CAD. Given a 2- 
manifold embedded in S 3 , Dey et al. [11] computed these handle-related cycles by computing the 
deformation retractions of the two components of the the embedding space bounded by the given 
2-manifold. Their work facilitates handle detection in real applications. However, the computed 
1-cycles are not guaranteed to be geometrically concise. Guskov and Wood [15, 19] detected small 
handles of a 2-manifold using the Reeb graph of the manifold. 

All of the aforementioned works are restricted to low- dimensional manifolds. Zomorodian and 
Carlsson [20] took a different approach to solving the localization problem for general dimension. 
Their method starts with a topological space and a cover, which is a set of spaces whose union 
contains the original space. They computed a homology basis and localized classes of it, using tools 
from algebraic topology and persistent homology. However, both the quality of the localization 
and the complexity of the algorithm depend strongly on the choice of cover; there is, as yet, no 
suggestion of a canonical cover. 

Contributions. We explore the localization problem using different size definitions of a cycle, 
namely, the volume, diameter and radius. 

• We prove that it is NP-hard to localize a class with its smallest cycle in terms of the 
volume and the diameter, by reduction from MAX-2SAT-B and MCCP, respectively. 

• The cycle with the minimal radius is literally the localized-cycle as defined in the com- 
panion paper. We show that although it may not be perfectly well-localized, it is a 
2-approximation of the cycle with the minimal diameter. A polynomial algorithm to 
compute this cycle is provided. 2 

For the stability problem, we prove that (1) the size of homology classes and (2) the subgroup 
filtration computed from the optimal homology basis are both stable with regard to small changes 
of the geometry of the concerned space. 

2. Localization 

In this section, we address the localization problem. For ease of computation, we restrict 
our work to homology groups over Z2 field. Throughout this paper, all the additions are mod 
2 additions. When we talk about a d-dimensional chain, c, we refer to either a collection of 

"^Note that their polynomial algorithm can only localize classes in the shortest homology basis. In fact, we will 
show in Section 2.1 that it is NP-hard to localize an arbitrary given class with the shortest cycle. 

In the companion paper, a polynomial algorithm is provided to compute localized-cycles for classes in the optimal 
homology class. In this paper, we extend it to localizing any given class. 
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d-simplices, or a n^-dimensional vector over Z2 field, whose non-zero entries corresponds to the 
included (i-simplices. Here is the total number of (i-simplices in the given complex. The relevant 
background in homology can be found in [17]. 

We formalize the localization problem as a combinatorial optimization problem: Given a 
simplcial complex K, compute the representative cycle of a given homology class minimizing 
a certain objective function. Formally, given an objective function defined on all the cycles, 
cost : Zd(K) — > R, we want to localize a given class with its optimally localized cycle, 

z pt(h) = argmincost(z). 

In general, we assume the class h is given by one of its representative cycles, Zq. 

In this paper, we explore three options of the objective function cost(z), i.e. the volume, 
diameter and radius, in the following three subsections. 

2.1. Volume 

The first choice of the objective function is the volume. 
Definition 2.1 (Volume). The volume of a cycle is the number of its simplices, vol(z) = card(z). 

For example, the volume of a 1-dimensional cycle, a 2-dimensional cycle and a 3-dimensional 
cycle are the numbers of their edges, triangles and tetrahedra, respectively. A cycle with the small- 
est volume, denoted consistent to a "well-localized" cycle in intuition. Its 1-dimensional 
version, the shortest cycle of a class, has been studied by researchers [13, 19, 11]. However, we 
prove in Theorem 2.3 that computing z v of h is NP-hard. 3 

More generally, we can extend the the volume to be the sum of the weights assigned to 
simplices of the cycle, given an arbitrary weight function, w : K — ► R, defined on all the simplices 
of K, formally, 

vol'(z) = ^2 w (°~)- 

cr£z 

Theorem 2.3 implies that computing z v using this general volume definition is still NP-hard, 
because Definition 2.1 is in fact a special case of this general definition (when w(o~) = 1, Vct 6 K). 
We prove by theorem 2.3 by reduction from the NP-hard problem MAX-2SAT-B [18]. 

Problem 2.2 (MAX-2SAT-B). Given N literals, x\ to x N , and M different clauses, d to c M , 
each in the form of (xj V Xj), (->Xi V xj), (xi V ->Xj) or V ~*Xj). Each literal appears in at most 
B clauses. Find an assignment assigning boolean values to all the literals such that the number 
of satisfied clauses are maximized. 

Theorem 2.3. Computing z v for a given h is NP-hard. 

Proof. Given an input of MAX-2SAT-B, we construct in polynomial time an input of the problem 
of computing z v for a given h, denoted as the Opt- Vol problem for short. We will later show 
that the answer of Opt- Vol gives us the answer of MAX-2SAT-B. The input simplcial complex 
for Opt- Vol consists of N literal tubes and M 1-dimensional clause cycles. 

(1) Literal Tubes. For each literal X{, we construct a literal tube Tj. Denote hi as the 1- 
dimensional class carried by Tj. Tj is made fat and long yet with two short end cycles, such 
that a representative cycle of hi with no more than L length can only be one of the two 
end cycles of the tube. (The parameter L will be given later.) See Figure 2 (Left) for an 
example, technical details of the construction are omitted because of the space limitation. 



Erickson and Whittlesey [13] localized 1-dimensional classes with their shortest representative cycles. Their poly- 
nomial algorithm can only localize classes in the shortest homology basis, not arbitrary given classes. 
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Let the two end cycles of Tj be Zi and z' { , corresponding to x,- t and -<Xi, respectively. Let 
all the 2N end cycles of the N literal tubes have equal length. 




Figure 2: Left: A literal tube Tj is made fat and long yet with two short end cycles, Z{ and z[. 

Right: Example of shared edges. The clause cycle w\ corresponds to the clause 
Ci = (-1x3 V 2:5). The red, black and blue cycles are w\, z' 3 and 25, respectively. 
Although there is unwanted edge intersections in the figure, the simplicial complex 
can be studied without embedding, and thus, is well defined. 

(2) Clause Cycles and Shared Edges. For each clause Cj, we construct a 1-dimensional 
cycle Wi. All these clause cycles have equal length. For a clause Cj = (x a V Xf,), pick three 
edges of Wi as shared edges, ef , e\ and e°' b . Let cycle Wi and the end cycle z a share ef, Wi 
and Zb share e\. Let e°' b be shared by the three cycles, Wi, z a and z\ } . If Cj = (->x a V xj,), 
substitute the end cycle z' a for z a in the aforementioned sharing, and denote the three 

shared edges as ef , e\ and e" ' b . The case of (x a V ->Xb) and (->x a V ->Xb) can be addressed 
similarly. See Figure 2 (Right) for an example. 

We deliberately put five extra edges between the three shared edges of Wi. Within each 
end cycle, we put at least one extra edge between any two shared edges. These extra edges 
make sure that each vertex of the clause cycles and the end cycles belong to at most one 
shared edge. This way we can avoid discussions about the validation of the constructed 
simplicial complex, e.g. two edges sharing two vertices, etc. 

(3) Length of Cycles. For a clause cycle lOj, as shown in Figure 2 (Right), we have three 
shared edges and five extra edges. Its length is eight. For an end cycle Z{ (or z-), it shares 
two edges with each involved clause. Since xi (or -iXj) appears at most B times, z% (or 
z'j) has at most 2B shared edges. We put at least one extra edge between any two shared 
edges. Let each end cycle has 4B edges. 

Next, for the constructed simplicial complex, we ask Opt- Vol to localize the homology class 
h = hi + X^il^i] w hh the cycle with the smallest volume, which is, the shortest cycle. 

We may need a cycle to represent h as the input. The cycle zq = Yli=i z i + J2i=i w i wm work. 
Next, we show that there is a bijection between the set of boolean assignments of the literals and 
the set of representative cycles of h with no more than 4NB + 8M edges. Under this bijection, a 
boolean assignment satisfies k clauses if and only if its corresponding cycle has 4NB + 8M — 2k 
edges. 

It is not hard to see that localizing h is equivalent to finding a representative cycle for each 
hi and sum these cycles up with all the clause cycles, Wi. Recall that we make each tube fat and 
long such that a representative cycle of hi with no more than L edges can only be one of its end 
cycles, Zi or z\. Let the parameter L = 4NB + 8M. Then, any representative cycle of h with no 
more than 4NB + 8M edges can only be the sum of the M clause cycles and N end cycles, coming 
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from the ./V tubes, formally, Yli=i w i + Y2i=i %i 1S either z\ or z[. For a boolean assignment X, 
we construct the bijection such that in its corresponding cycle z, z% = Zi iff x% = True and 5j = z[ 
iff = False. 

For each boolean assignment, X, and the corresponding representative cycle, z. A clause Cj 
is satisfied by X if and only if one of the three shared edges of Wi appear in z. Ci is NOT satisfied 
if and only if all three shared edges of Wi appear. We can see this by enumerating all possibilities: 

(1) Ci is satisfied if and only if one of the following three cases happen. 

• x a = True and x = True. Wi, z a and z\, are all chosen. In this case, edge ef and e\ 

are canceled in z because of the mod 2 addition. Edge ef' is NOT because we count 
it three times; 

• x a = True and x = False. Wi and z a are chosen. z\> is NOT. Edge ef and ef' b are 
canceled. Edge e\ is NOT canceled. 

• x a = False and x = True. Similar to the second case. 

(2) Ci is NOT satisfied if and only if x a = False and x = False, which means z a and z are 
not chosen. Since Wi is always chosen, all the three shared edges are not canceled. 

In z, M clause cycles contribute 8M edges and N end cycles contribute 4NB edges. Consequently, 
k of the M clauses are satisfied by X if and only if 2k shared edges are canceled, that is, z has 
ANB + 8M - 2k edges. 

With this bijection, it is not hard to see that a cycle with the minimal number of edges 
uniquely corresponds to a boolean assignment satisfying the maximal number of clauses. ■ 

2.2. Diameter 

When it is NP-hard to compute z v , one may resort to the geodesic distance between elements 
of z, denoted as dist(p, q). As we will deal with a simplicial complex K, it is natural to introduce 
a discrete metric, that is, a length on each edge. Define dist(p, q) as the length of the shortest 
path connecting the two vertices p and q, in the 1-skeleton of K. 

The second choice of the objective function is the diameter. 

Definition 2.4 (Diameter). The diameter of a cycle is the diameter of its vertex set, diam(z) = 
diam(vert(z)), in which the diameter of a set of vertices is the maximal geodesic distance between 
them, formally, 

diam(S') = max dist (p, q) . 

p,qeS 

Intuitively, a representative cycle of h with the minimal diameter, denoted z^, is the cycle 
whose vertices are as close to each other as possible. The intuition will be further illustrated in 
Section 2.3 by comparison against other criteria. We prove in Theorem 2.6 that computing of 
h is NP-hard, by reduction from the NP-hard Multiple- Choice Cover Problem (MCCP) by Arkin 
and Hassin [2]. 

Problem 2.5 (Multiple- Choice Cover Problem). Given n vertices, V = {v±, V2, v n } in the 
Euclidean plane, E, let d(vi, Vj) be the Euclidean distance between any two vertices V{ and Vj. Let 
Si, S2, S m C V be disjoint subsets of V such that \J^ =1 Si = V. Find a cover CCV containing 
one and exactly one vertex from each subset Si, whose diameter, diam(C), is minimized. 4 

Theorem 2.6. Computing z^ for a given h is NP-hard. 

4 The original MCCP problem by Arkin and Hassin only requires the cover to have nonempty intersection with each 
subset Si. We revise the problem to facilitate our proof, without influencing the NP-hardness. The reason is the 
optimal result of the revised problem is definitely an optimal result of the original problem. 
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Proof. We present a polynomial algorithm transforming an input of MCCP into an input of the 
Opt-Diam problem, namely, computing for a given h. We will show that the solution of Opt- 
Diam gives us the solution of MCCP. As part of the input of Opt-Diam, the constructed simplicial 
complex K consists of m tubes, Ti,...,T m , as well as extra edges connecting vertices. 

For each vertex subset Si, we find a simple path in the Euclidean plane, E, going though each 
vertex of Si once without self-intersection, £j = (v i, t>2, v card rg.\), which contains card(S'j) — 1 
edges. The edge lengths are the same as the Euclidean distances between corresponding vertices in 
E. See Figure 3 (Left). We construct a a slender threadlike tube Tj, which carries the path T; L 




Figure 3: Left: An input of MCCP: 3 disjoint vertex subsets in Euclidean plane, Si, S2 and 
S3. The simple paths, £1, £ 2 and £3, are also shown, although they are not part of 
the input. 

Right: A triangular cylinder corresponding to the edge VjVj + i. 

has 3card(Si) vertices, SiUS^US'/, where S[ = {v[,v' 2 , -,^ ard(5 .)} and S'{ = {v'{, v%, -,^ d ( Sj )}- 
For any j, v'j and v'J live very close to Vj. Corresponding to the card(Si) — 1 edges in £j, Tj consists 
of card(S'i) — 1 triangular cylinder concatenated together. 5 Each edge VjVj+i corresponds to a 
triangular cylinder with vertices Vj, v'j, v'-, Vj + ±, v'j +1 and v" +l . In the triangular cylinder, the 
short edges are very short, say, no longer than e. The long edges have the length similar to the 
length of edge VjVj+i. See Figure 3 (Right) for one such triangular cylinder. 

We construct the simplicial complex, K, as following: For any i, Ti C K; For any two 
vertices v\, V2 £ V, if they are not neighbors, add an edge connecting them, whose length is their 
Euclidean distance in the Euclidean plane E. See Figure 4 for the complex constructed from the 
input in Figure 3 (Left). Please note that although in the figure, the embedding of K in M 3 has 
self-intersection, the simplicial complex K can be studied without embedding, and thus, is well 
defined. 




Figure 4: The constructed simplicial complex K. For simplicity, we only include new edges 
connecting vertices in S2 U S3. The rest edges are in K but omitted in this figure. 

For the constructed complex K, we ask Opt-Diam to localize the 1-dimensional class YlT^i hi, 
where hi is the only 1-dimensional class carried by the tube Tj. We need a cycle to represent it 

5 By a triangular cylinder we mean the surface of a 3-prism with the two end triangles missing. To facilitate the 
concatenation, corresponding edges of the end triangles may not be parallel to each other, as in a standard 3-prism. 
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as the input for Opt-Diam. We use zo = J27=i z io> where Zi is the 1-cycle whose vertices are Vi , 
v' iQ and v" . Vi is an arbitrary vertex in Si. 

Next, we construct a cover C from the solution of Opt-Diam, z, and show that C is the solution 
of MCCP. We construct an intermediate vertex set Co Q V as follows. A vertex v belongs to Co 
if and only if any of Vi, v\ and v" belongs to the vertex set of z, vert (2). The solution z is in the 
form YllLi z ii w here z\ represents class hi. Therefore, Co has nonempty intersection with each 
vertex set Si. We compute the cover C by picking one vertex from each Si (~l Co- 

Within the simplicial complex, diam(C) = diam(Co) and | diam(Co) — diam(z)| < 2e. Fur- 
thermore, C has the same diameter in the simplicial complex, if, and in the Euclidean plane, 
E. Since e is arbitrarily small, we can see that C is the cover with the minimal diameter in the 
Euclidean plane, and thus, is the solution of MCCP. ■ 



2.3. Radius 

This section needs more notations about the discrete geodesic distance and the geodesic balls. 
We define the discrete geodesic distance from a vertex p G vert(if), f p : vert(if) — ► Z, as follows. 
For any vertex q G vert (if), f p (q) = dist(p, q). For any simplex a £ K, f p (o~) is the maximal 
function value of the vertices of a, f p (cr) = max qevcIt ^ fp(o)- Finally, we define a geodesic ball 
Bp, p G vert (if), r > 0, as the subset of if , B p = {a G if | f p (o~) < r}. It is straightforward to 
show that these subsets are in fact subcomplexes. 

The third option of the objective function is the radius. 

Definition 2.7 (Radius). The radius of a cycle is the radius of the smallest geodesic ball carrying 
it, formally, 

rad(z) = min max dist(p, q), 

pevert(if) q£vevt(z) 

where vert (if ) and vert (2) are the sets of vertices of the given simplicial complex if and the cycle 
z, respectively. 

The representative cycle with the minimal radius, denoted as z r , is the same as the localized- 
cycle defined in the companion paper [7] . Intuitively, z r is the cycle whose vertices are as close to 
a vertex of if as possible. However, z r may not necessarily be concise enough in intuition. It may 
wiggle a lot while still being carried by the smallest geodesic ball carrying the class. See Figure 
5 (Left), in which we localize the only nontrivial homology class of an annulus (the light gray 
area). The dark gray area is the smallest geodesic ball carrying the class, whose center is p. Note 
that the geodesic ball of the annulus may not seems like a disc in the embedded Euclidean plane. 
Besides, the cycle with the minimal diameter (Figure 5 (Center)) avoids this wiggling problem 
and is concise in intuition. This in turn justifies the choice of diameter in Section 2. 2. 6 

In this section, we first show in Theorem 2.8 that z r is a 2-approximation of z^. Second, we 
show that z r can be computed in polynomial time. 

Theorem 2.8. diam(z r ) < 2diam(z f f). 

Proof. First, the triangle inequality of the geodesic distance suggests that for any two vertices of 
z r , pi andp2, their geodesic distance is dist(pi,2?2) < dist(pi,po) +dist(po>P2) < 2rad(z r ), where 
po is the center of the smallest geodesic ball carrying the cycle z r and the class. This implies that 
the diameter of z r is no greater than twice of its radius. 

Second, the diameter of Z& is no less than its radius. To see this, pick up a geodesic ball 
centered at any vertex of z& with radius diam(z^). This ball carries z&. Finally, diam(z r ) < 
2rad(z r ) < 2rad(z^) < 2diam(zd). ■ 



This figure also illustrates that the radius and the diameter of a cycle are not strictly related. For the cycle z r in 
the left, its diameter is twice of its radius. For the cycle z d in the center, its diameter is equal to its radius. 
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Figure 5: Left: the cycle with the minimal radius, z r . 

Center: the cycle with the minimal diameter, z<j. 

Right: Unstable Optimal Homology Basis. Used in Section 3.2 



This bound is a tight bound. In Figure 5, the cycle in the left is z r whose diameter is twice 
of the radius of the dark gray geodesic ball. The cycle in the center is whose diameter is the 
same as the radius of the ball. We have diam(z r ) = 2diam(zd). 

Theorem 2.9. We can compute z r in polynomial time. 

Proof. The proof is based on a polynomial algorithm. In the companion paper [7] , we computed 
z r for each class in the optimal homology class. Based on this algorithm, we devise a polynomial 
algorithm computing z r for an arbitrary class. Because of the space limitation, we put the 
algorithm in Appdendix A. ■ 



3. Stability Result. 

In this section, we prove that our measurement of homology in the companion paper [7] is 
stable: small changes of the geometry of the space imply small changes of our measurement. We 
define a change of the geometry of the space as a change of the metric in the space. We measure 
this change by measuring the Loo-norm difference of geodesic distance functions before and after 
the change. To facilitate the proof, we assume that during the change, the simplicial complex 
remains the same except for the discrete geodesic distance. Formally, we quantify the change of 
the geometry as 

e = max \fl - f^, (3.1) 

where f£ and f£ are the discrete geodesic distance functions before and after the change. 

Recall that in the companion paper, our measurement of homology involves two concepts, 
namely, the size of a class and the optimal homology basis, we defined the size of a homology 
class h as the radius of the smallest geodesic ball carrying h, formally, 

S(h)= min r f {h), 

where rf p (h) is the smallest value t such that the geodesic ball oo,t] carries h. A geodesic 

ball carried h if and only if it carries one of the cycles of h. Based on this size definition, we defined 
the optimal homology basis H as the set of (3 linearly independent homology classes whose size 
have the minimal sum, where (3 is the Betti number. Matroid theory suggests that H can be 
computed using a greedy algorithm: sort all the nontrivial classes in a monotonically increasing 
order according to their size. Check the smallest classes one by one. Pick each class that is not 
linearly dependent on what we have chosen, until (3 classes are collected. 
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In this section, we prove the stability of our measurement by showing that (1) for a single 
homology class, the size is stable; and (2) for the whole homology group, although the optimal 
homology basis is not stable, the group structure filtered by the size is stable. 

3.1. A Single Class 

For a single homology class, the size measure remains stable. Denote <S' 1 (/i) and S 2 (h) as the 
size of class h before and after the change (computed using and f 2 , respectively). We have the 
following theorem. 

Theorem 3.1. |-S' 1 (/i) — S 2 (h)\ < e, where e is the upperbound of the geometry change as defined 
in Equation (3.1). 

Proof. We show that for any specific vertex p, \rfi(h) — rfz(h)\ < e. This leads to the fact that 
= min pgvcrt (^-) rji{h) and S 2 {h) = min pevcrt (^) rp{h) differ in no more than e. 
For any vertex p, denote r\ and ri as rji(K) and Tj2(h) for short. According to the definition, 
they are the radii of the smallest geodesic balls carrying h computed using the geodesic distance 
fp and f 2 , respectively. For any simplex a in the ball Bp 1 calculated using f£, fp(cr) < r\, and 
thus f 2 (a) < fp(a) + e < n + e. This means that the ball Bp 1 calculated using f£ is a subcomplex 
of the ball Bp 1+e calculated using f 2 Therefore, according to the definition of ri, it is no greater 
than n + e. Similarly, we can see that n is no greater than r^ + e. ■ 

3.2. The Homology Group 

Since the size of different classes can be very close, the optimal homology basis is not stable. 
For example, in Figure 5 (Right), either {[zi], [zi], [23]} or {[zi], [zi\, [23] + [zi]} can be the optimal 
homology basis for little geometry changing, because the sizes of [23] and [z\\ + [23] are quite 
close. However, there is still some stability property in the homology group structure if we filter it 
with the class size. More specifically, the subgroup generated by small homology classes remains 
stable. For example, in Figure 5 (Right), although the optimal homology basis is unstable, the 
subgroup generated by the two smaller classes in the optimal homology basis will always be the 
one generated by [z±] and [22]. 

We formalize this stability by defining the subgroup filtration of a topological space and the 
distance between two such nitrations. A subgroup filtration is a sequence of subgroups of the 
homology group generated by subsets of the optimal homology basis filtered by the class size. A 
formal definition is as follows. 

Definition 3.2 (Subgroup Filtration). Given an optimal homology basis TC = {hi, h,2, hp}, 
where we assume S(hi) < S(hi + i), a subgroup filtration is a sequence of subgroups of the homology 
group, X = {tpo, ipi,il>2, V 7 /?}) where ^ = span(/ii, /12, hi) is the subgroup generated 7 by the 
classes h\, hi, h, L . 

Obviously, the subgroup filtration is a sequence of subgroups of H(ii") with a nested structure 
ipi Q i'i+i-, ^o = and ipp = H(i^T). For convenience, we denote the size of a subgroup, ipi, as 
the size of the largest class in the optimal homology basis generating fy, formally, S(tpi) = S(hi). 
Please note that S(ipi) is not the size of the largest class in ipi. Next, we define the distance function 
between two subgroup nitrations X 1 and X 2 , which requires the definition of the projection of 
one subgroup in one filtration onto the other filtration. 



Since here the coefficient ring R = Z2, the homology group and all its subgroups are vector spaces. Therefore, we 
use the notation ipi — span(/ii, h 2 , hi) when we say hi, h 2 , hi generates ipi. 
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Definition 3.3 (Projection). Given two subgroup nitrations of a same homology group X 1 = 
{ipQ, Y>2) ■•■) ipp} and X 2 = {ip 2 ., ipf, V'l' define the projection of ^ onto X 2 as the first 

subgroup in X 2 that carries tpj, formally, 

projOh A' 2 ) = V, 2 , s.t.j = min fc. 

Definition 3.4 (Distance). Define the distance between X 1 and X 2 as the maximal difference 
between the sizes of any subgroup in X 1 or X 2 and its projection onto the other filtration, formally, 

dist(*\ X 2 ) = maxImaxIS 1 ^ 1 ) - S 2 (proj(^, X 2 ))\, max \S 2 ^ 2 ) - S\ W o]^l X 1 )^}. 

Let X 1 and X 2 be the subgroup nitrations of the original space and the one after the change. 
We can prove the following stability result. 



Theorem 3.5. 



dist(X\X 2 ) <e= max I/ 1 - f 2 ^. 

p<=vcrt(K) 



To facilitate the proof, we state the following two facts about any optimal homology basis 
TL = {hi,fi2,--- ,h/3} (sorted according to their size), which follows the greedy algorithm of 
computing TL. 

• Fact 1: Any class in the basis, hj G TL, cannot be linearly dependent on classes whose 
size are smaller than S(hj); 

• Fact 2: For any homology class h and real number so, S(h) < sq implies that there exists 
a k < (3, such that h G span(/ii, /12, • • • , /ifc) and S(hk) < so- 

With these facts, we prove Theorem 3.5 as follows. 

Proof. It suffices to show that for any subgroup ipj, — S 2 (ip 2 )\ < e, where ip 2 is the 

projection, proj (?/>*, X 2 ). Denote the optimal homology bases before and after the change as 
TL 1 = {h\, h\, h\} and TL 2 = {h 2 , h 2 ,, /i^}, both of which are sorted by the class size. We 
know that ipj = span(/i}, h\, h\) and ip 2 = span(/i 2 , h\, h 2 ). We will use the following two 
facts which follows the construction of the optimal homology basis. 
We first prove by contradiction that 

S 2 ^ 2 )>S\ipj)-e. (3.2) 

Suppose S 2 (ip 2 ) < S 1 ^}) — e. Any class in the set {h 2 , h 2 ,,--- , h 2 } has its S 2 size smaller than 
5 x (/i|) — e, and thus, according to Theorem 3.1 has its S* 1 size smaller than S ,1 (/i t 1 ). Based on the 
definition of the projection, tpj C tpj. The class h\ is linearly dependent on classes hf, h 2 ,, ■ ■ ■ ,h 2 , 

all of which have their S 1 size smaller than S l (hj). This contradicts Fact 1. Equation (3.2) is 
proved. 

Second, we show that 

S 2 ^ 2 )<S\i,\) + e. (3.3) 
For any I < i, S 1 ^) < S 1 (hj). According to Theorem 3.1, 

S\hj) < S\hj) + e < S\h\) + e = S\ti) + e. 

Based on Fact 2, for all such /, there exists k, such that hj £ span(/i 2 , h 2 ., • • • , h\) and S 2 (ip 2 ) = 
S 2 {h\) < S 1 ^}) + e. Therefore, ipj C ip 2 . Based on the definition of the projection, %j) 2 has its 
size no greater than Equation (3.3) is proved. ■ 
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4. Future Problems 

We found the following related problems to be interesting and deserving further study. 

(1) We prove that localizing any given class with the cycle with the minimal volume is NP- 
hard. However, notice that Erickson and Whittlesey [13] showed in low dimension that we can 
localize the shortest homology basis in polynomial time. An interesting question is then can we 
in general localize the basis with the minimal volume sum in polynomial time? What about the 
diameter? 

(2) We restricted the stability proof to the case when the geometry change is a homeomor- 
phism (the simplicial complex remains unchanged). Is it possible to extend this to incorporate 
topological changes? More specifically, is the measurement still stable even if small homology 
classes are destroyed and created during the change? 
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Appendix A. A Polynomial Algorithm to Compute z r , the Cycle with the Min- 
imal Radius 

In the companion paper, a polynomial algorithm is provided to compute z r for classes in the 
optimal homology basis. We generalize this algorithm to compute z r for any given homology class. 
Please refer to [7] for the correctness of this algorithm. 

First, we need the smallest geodesic ball carrying the class we want to localize. The procedure 
Bmin(i^,2;o) in Algorithm 1 computes this ball, which uses the procedure Contain-Cycle(i^,il"o,2;o) 
(See Algorithm 2) to detect whether a subcomplex Kq carries any cycle of the class to localize. 

Algorithm 1 Bm\n(K,z ) 

Goal: Find the smallest geodesic ball, B m i n , carrying any cycle of h. 
Input: K: the given simplicial complex. 

zq: a cycle representing h. 
Output: Pmin and r m j n : the center and radius of the computed ball B m ; m . 

1: for p e vert(if) do 

2: r=0 

3: repeat 

4: r=r+l 

5: until Contain-Cycle(K ,B^,z ) 
6: if r < r m in then 

I ■ 1 mm — ' 

8: Pmin = P 

9: end if 
10: end for 



Algorithm 2 Contain-Cycle^-ffo^o) 

Goal: test whether Kq carries any representative cycle of the class h = [zq\. 
Input: K: the given simplicial complex. 

Kq: the subcomplex. 

zq: a cycle representing h. 
Output: Boolean. 
1: Z d = [d d+1 ,z ] 

2: compute d^^° and Z^ K ° by picking up rows of <9<f+i and Z d whose corresponding simplices 

do not belong to Kq 
3: if rank(zf Wo ) - rank^f ) + 1 then 
4: return true 
5: else 

6: return false 
7: end if 



Second, when B m i n is computed, we compute a cycle of h carried by B m i n as follows. Solve 
the equation system <9 ( ^ | _\ B "" n 7 + z ^\ Bmin = o, where d*^f min is the matrix formed by the rows 

of dd+i whose corresponding simplices do not belong to B m i n , z ^\ Brnm [ s the vector formed by 
the entries of zq whose corresponding simplices do not belong to B m i n . Using the solution 7, 
zq + <9(i + i7 is a cycle of h carreid by B m , and thus is the desired z r . 



